RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Source: / C)[Pt ' ~ 

Date Processed by STIC: Z3 -Q*2*~ 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 
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Does Not Comply 
Raw Sequence Listing Error SumQcwpcted Diskette Needed 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER: 0*1 M 3 % f OlSfy 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



1 Wrapped Nucleics 

Wrapped Aminos 



The numbcrAext at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating il. Please adjust your right margin to .3; this will 
prevent "wrapping." 



Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



3 Misaligned Amino 

Numbering 

4 Non-ASCII 



The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers, 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text 



Variable Length Sequences) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n orXaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 



Patentln 2.0 
' "bug" 



A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



7 Skipped Sequences Sequencers) 

(OLD RULES) 



missing. If intentional, please insert the following lines for each skipped sequence: 



Skipped Sequences 
(NEW RULES) 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences 



Sequencers) 



missing. If intentional, please insert the following lines for each skipped sequence. 



<210> sequence id number 
<400> sequence id number 
000 



10 



11 



12 



Use of n's or Xaa's Use of n's and/or Xaa's have been detected in the Sequence Listing. 

(NEW RULES) Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



Invalid <213> 
Response 

Use of<220> 



Patentln 2.0 
""bug" 



Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Gcnus/spccics). <220>-<223> section is required when <21 3> response is Unknown or 
is Artificial Sequence 



Sequencers) _ 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <2 1 3> "Organism" response is "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 06/01/1998, Vol. 63, No 104, pp. 29631-32) (Sec. 1 .823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



AMC - Biotechnology Systems Branch - 06/04/2001 
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Does Not Comply 
Corrected Diskette Needed 




OIPE 



RAW SEQUENCE LISTING DATE: 0 5/23/2002 

PATENT APPLICATION: US/09/938 , 0 3 5A TIME: 18:06:05 

Input Set : A:\seq list-20731, RocheVit . txt 
Output Set: N:\CRF3\05232002\I938035A.raw 

3 <110> APPLICANT: Roche Vitamins AG 

5 <120> TITLE OF INVENTION: Microbial process for producing L-ascorbic acid and 

6 D-erythorbic acid 

8 <130> FILE REFERENCE: Alicyclobacillus NA20, 21, FJ21 16S nuc 
C--> 10 <140> CURRENT APPLICATION NUMBER: US/09/938 , 035A 
C--> 11 <141> CURRENT FILING DATE: 2001-08-23 

13 <150> PRIOR APPLICATION NUMBER: EP Application No. 00118059.5 

14 <151> PRIOR FILING DATE: 2000-08-23 
lb <160> NUMBER OF SEQ ID NOS : 3 

18 <170> SOFTWARE: Patentln Ver . 2.1 



ERRORED SEQUENCES 



W- - > 



w- - > 



98 

99 

100 

101 
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115 
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12 5 
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128 



<211> LENGTH 



<212> 
<213> 
<220> 
<221> 
<222> 
<223> 
<400> 



Alicyclobacillus sp 



rRNA 
(1) • 



( 1495) 

FJ-21 



<210> SEQ ID NO: 3 

1495 
TYPE: DNA 
ORGANISM: 
FEATURE : 
NAME/KEY: 
LOCATION: 

OTHER INFORMATION: 
SEQUENCE: 3 
aggacgaacg ctggcggcgt gcctaataca 
gcggcggacg ggtgaggaac acgtgggtaa 
aacgggcgct aatgccggat acgcccgcga 
ttgggccgct gagagaggag cccgcggcgc 
aaggcgacga tgcgtagccg acctgagagg 
cccagactcc tacgggaggc agcagtaggg 
agcaacgccg cgtgagcgaa gaaggccttc 
cggcatgggg agtggaaagc cccatgcgag 
tacgtgccag cagccgcggt aaaacgtagg 
aaagggtgcg taggcggtcg agcaagtctg 
ggctctggaa actgcttgac ttgagtgctg 
tgaaatgcgt agagatgtgg aggaatacca 
gacgctgagg cacgaaagcg tggggagcaa 
gtaaacgatg agtgctaggt gttgggggg 
gcactccgcc tggggagtac ggtcgcaa^a 
cacaagcagt ggagcatgtg gtttaa^tcg 
acatccctct gacgggtgca gagatzgcacc 
catggttgtc gtcagctcgt gt^c^tgagat 
cttgacctgt gttaccagcg cgnuanggcg 
ggaggaaggc ggggatgacg tcaaatcatc 



Partial 16SrR 



tgcaagtc 
tctgcctt 
ggaggca££t 
attagc 
gtgaccg^cc 
aatcttccgc 
gggttgta 
acggtacc/ga 
gggcgagcgt 
gagttfaaagt 
gagaggcaag 
rggcgaagg 
:aggattag 
cacaccccag 
ctgaaactca 
aagcaacgcg 
ttcccttcgg 
gttgggttca 
gggactcaca 
atgcccctga 




Ok 



gene seouence 



gcggaccfefct 
agacccpgaat 
tcttgcgggg 
tggcggggta 
factgggac 
latgggcgca 
gctctgttgc 
gtgaggaagc 
tgtccggaat 
ccatggctca 
gggaattcca 
cgccttgctg 
ataccctggt 
tgccgaagga 
aaggaattga 
aagaacctta 
ggcagaggag 
gtcccgcaac 
ggtgactgcc 
tgtcctgggc 



tctgaggtca 60 
aacgcccgga 120 
aaaggcccga 180 
acggcccacc 240 
tgagacacgg 300 
agcctgacgg 360 
tcggggagag 420 
cccggctaac 480 
cactgggcgt 540 
accatgggat 600 
cgtgtagcgg 660 
gacagtgact 720 
agtccacgcc 780 
aacccaataa 840 
cgggggcccg 900 
ccagggcttg 960 
acaggtggtg 1020 
gagcgcaacc 1080 
ggcgtaagtc 114 0 
tacacacgtg 1200 



file:7C:\CRF3\Outhold\VsrI938035A.htm 



5/23/02 
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RAW SEQUENCE LISTING 

PATENT APPLICATION : US/09/938, 035A 



DATE: 05/23/2002 
TIME: 18:06:05 



129 ctacaatggg 

130 ctcgtagttc 

131 cggatcagca 

132 cgagagtcgg 

133 ggtggggtcg 



Input Set : A:\seq list-207 3 1 , RocheVit . txt 
Output Set: N:\CRF3\05232002\l938035A.raw 

cggtacaaag ggaggcgaag ccgcgaggcg gagcgaaacc caaaaagccg 1260 

ggattgcagg ctgcaactcg cctgcatgaa gccggaattg ctagtaatcg 1320 

tgccgcggtg aatacgttcc cgggccttgt acacaccgcc cgtcacacca 1380 

caacacccga agtcggtgag gtaaccccga aaggggagcc agccgccgaa 1440 

atgattgggg tgaagtcgta acaaggtagc cgtaccggaa ggtgc 1495 



E--> 136^1 
E--> l/9 1 
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VERIFICATION SUMMARY DATE: 05/23/2002 

PATENT APPLICATION: US/09/938 , 035A TIME: 18:06:06 

Input Set : A:\seq list-207 31 , RocheVit . txt 
Output Set: N:\CRF3\05232002\l938035A.raw 

L-10 M:270 C: Current Application Number differs, Replaced Application Number 
L 11 M:271 C: Current Filing Date differs, Replaced Current Filing Date 



L 


34 


M: 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used , 


for 


SEQ 


ID# 


1 


after 


pos . 


: 180 


L 


47 


M: 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used , 


for 


SEQ 


ID# 


1 


after 


pos . 


: 960 


L 


54 


M 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used , 


for 


SEQ 


ID# 


1 


after 


pos . 


: 1380 


L 


73 


M 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used , 


for 


SEQ 


ID# 


T 


after 


pos . 


: 180 



L:112 M:341 W (46) "n" or "Xaa" used, for SEQ ID# : 3 after pos.: 180 

L:127 M:341 W (46) "n" or "Xaa" used, for SEQ ID# : 3 after pos.: 1080 

L:136 M.254 E No. of Bases conflict, this line has no nucleotides. 

M:254 Repeated in SeqNo=3 
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